A Novel Algorithm for Mining Fuzzy High Utility Itemsets

نویسندگان

  • Cheng-Ping Lai
  • Pau-Choo Chung
  • Vincent S. Tseng
چکیده

Utility mining is to find the itemsets in a transaction database with high utility values like profits. Although a number of algorithms on high utility mining have been proposed, they did not reflect the fuzzy degree of quantity and profit level for mined high utility itemsets, which are essential for decision making in various applications like stock control and sales analysis. In this paper, we explore to apply fuzzy sets theory to the utility mining problem and propose a novel method, namely FHUI (Fuzzy High Utility Itemsets)-Mine, for mining fuzzy high utility itemsets. In addition to reflecting the fuzzy degree for quantity and profit regions of high utility itemsets, FHUI-Mine also provides a fuzzy threshold range that may include itemsets with profits slightly less than the designated threshold value. To prove the feasibility of FHUI-Mine, it was compared with the well-known Two-Phase algorithm through experimental evaluation. The results show that FHUI-Mine delivers higher mining capability since it can not only mine all high utility itemsets found by Two-Phase algorithm but also discover additional itemsets that are potentially high utility ones.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for High Average-utility Itemset Mining

High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...

متن کامل

A Fuzzy Algorithm for Mining High Utility Rare Itemsets – FHURI

Classical frequent itemset mining identifies frequent itemsets in transaction databases using only frequency of item occurrences, without considering utility of items. In many real world situations, utility of itemsets are based upon user’s perspective such as cost, profit or revenue and are of significant importance. Utility mining considers using utility factors in data mining tasks. Utility-...

متن کامل

Data sanitization in association rule mining based on impact factor

Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...

متن کامل

Temporal Fuzzy Utility Mining with Upper-Bound

Fuzzy utility mining reflects fuzzy degrees of quantities and profits for high utility itemsets. In generally, transaction time is also concerned, and not all products sold are always on the shelf. Thus, in this paper we present an effective framework, which considers the transaction period of each product from the first transaction it appears to the last transaction in the whole database for m...

متن کامل

An efficient algorithm for mining temporal high utility itemsets from data streams

Utility of an itemset is considered as the value of this itemset, and utility mining aims at identifying the itemsets with high utilities. The temporal high utility itemsets are the itemsets whose support is larger than a pre-specified threshold in current time window of the data stream. Discovery of temporal high utility itemsets is an important process for mining interesting patterns like ass...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010